DOCaWENT RESUME 



ED 056 539 



FL 002 335 



AUTHOR 

TITLE 

INSTITUTION 

PUB DATE 
NOTE 



Mansellr Philio 

Linguistic Parameters in Performance Models. 
Essex Univ. , Colchester (England) • Language 
Centre. 

Dec 70 

31p.; In Occasional Papers 8, p51-81 



EDRS PRICE MF-$0.65 HC-$3.29 __ ... 

DESCRIPTORS Articulation (Speech) ; Descriptive Linguistics; 

Environmental Influences; Language Patterns; Language 
Research; Language Universals; ♦Linguistic 
competence; ^Linguistic Performance; *Linguistic 
Theory; *Phonetics; ♦Phonology; Psycholinguistics; 
Speech; Syllables; Transformations (Language) 



This paper deals with problems concerning the nature 
of the input to a phonetic processor. Several assumptions provide the 
basis for consideration of the problem. There is a phonological level 
of processing which reflects the sound structure of the language; the 
rules associated with it are not affected by variables associated 
either with the motor system or the environment. One level of 
processing can be called motor planning, another can be seen as 
executing the motor plan, and a level of control operates over the 
execution of the motor plan. The discussion that follows these 
assumptions concerns the phonetic processing mechanism^ and raises 
larger questions concerning the interrelation of linguistic 
competence and performance. A bibliography is included. 

(Author/VM) 



o 

ERIC 



» ' > 

51 



< 7 ^ 

lTN 

o 



J.s. DEPARTriiriNTOF HEALTH. EDUCATIC 
& WELFARE 
OFFICE OF EDUCATION 
THIS DOCUMENT HAS BEEN REPRODUCEI 
EXACTLY AS RECEIVED FROMTHE PERSON 0 
ORGANIZATION ORIGINATING IT. POINTS 0 
VIEW OR OPINIONS STATED DO NOT NECES 
SARILY REPRESENT OFFICIAL OFFICE OFEDU 
CATION POSITION OR POLICY. 



O 

UlJ 



"PERMISSION TO REPRODUCE THIS COPY- 
RIGHTEE MATERIAL HAS BEEN GRANTED 



TO ERIC AND ORGANIZATIONS OPERATING 
UNDER AGREEMENTS WITH THE U S. OFFICE 
OF EDUCATION. FURTHER REPRODUCTION 
OUTSIDF: THE ERIC SYSTEM REQUIRES PER- 
MISSION OFTHE COPYRIGHT OWNER." 



Linguistic Parameters in Performance Models 



Philip Mansell 




1.0 Introduction 



4:‘- 






l/» 

(O 



I 'O 

I ’ 

i-i: 



j 

0 - 



Recent work in phonetic modelling has assumed the 
existence of previous processing; that is to say, it has 
denied the autonomy of phonetics. In particular it has 
assumed the existence of a phonology. Under current 
allophonic theory (Tatham [1969]; see also Kim [1966], 
Ladefoged [1967]) the ^phonology is to generate those 
(•extrinsic’) allophones which are viewed as part ox tP.e 
linguistic structure of the language, whereas the result 
of the phonetic processing is seen to be the realisation 
of this language-specific set of extrinsic allophones 
through a' filter, as it were, of ’intrinsic allophonic 
rules, which state universal consequences for the inter- 
section of a certain speech intention, time, and the 
mechanical apparatus of- speech. 
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The general position, then, is clear^; what is not 
clear is: 

(i) how the belief that phonetics is not autonomous 
is derived, and 

fii) the details of the form of the interface between 
the presupposed phonology and the phonetic processing. 

Now, should it be that no good case could be made 
for the belief that phonetics is dependent on prior pro- 
cessing, then what will be seen below to be the difficult 
set of problems entailed by the phonology/phonetxcs inter- 
face could be bypassed, and phonetic modelling of a 
general kind would come into line with the numerous schemes 
for rule-governed speech synthesis which assume an 
essentially phonemic input. And, indeed, I think it is 
true to say that initially allophonic theory lacked a 
formal proof of the non-autonomy of phonetics. At beot, 
it could point to the fact that, given that the study of 
performance was in some sense the study of the linguistic 
form of the utterance in interaction with the environment 
fa view that will continue to be propounded here) it was 
the linguistic form (originally identified with the Chomsky- 
Halle competence model) which was the most richly specifiea, 
and hence a good basis for experimental research. 



Recently, however, a paradigm has been defined over 
a number of speech parameters which suggests that we must 
allow for the possibility of the involvement of linguisti 
parameters at the heart of a production model for speech 
(see 2.3.3 below). This paradigm lacks as yet experiment 
validation, but is susceptible to it. Its very plausibil 
however, makes it possible to continue under the over- 
riding assumption that phonetics is not autonomous. 
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Given this assumption, we cannot bypass input problems 
nor are such problems explicitly catered for in allophonic 
theory, for, as hinted above, allophonic theory can be 
seen as stating what would be the case, given a competence 
phonology interfaced with a competence phonetics. 



This paper will attempt to demonstrate that it is 
possible, without an undue number of prior assumptions 
and without employing circular arguments to proceed towards 
an independent specification of the input, V^a a model ot 
phonetic processing derived from observation of the 
output data. , 
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The output from a particular competence phonology 
aving been found wanting in terms of this independent 
;;rci£lcatio„ of ioput necessities, the »roader question 
if whether we are in a position to generalise the result 
if the above arguments and show that the output from 
mp oLpetLce phonology would fail to meet such input 
jpecifications is discussed. 

Finally, some experimental consequences of the 
ibove discussion are pointed out. 



2 . 0 Specification of Phone tic Processej^ 
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There are, admittedly, wide areas 
amongst researchers in the 
fact about the three models of phoneti 
currently the subject of experiment [F 
Cooper [1966]; Tatham [ 197 (b]) xs their 
regard to the general tasks of a phone 
stated above, however, it is not at al 
theoretical status of these generally 
Fromkin [1966] perhaps comes closest t 
such subjects, but there is in her pap 
jump between highly general considerat 
model-building, and the specification 
of phonetic processing. 
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lacking, especially that set that, hopefully, relates 
electromyographic activity to articulation. And until a 
complete set of transformations can be derived . demonstrating 
the equivalence between these various sets of data, we 
have in principle no means of knowing whether our particular 
characterisation of the output is a suitable one for the 
purposes of making inferences about the natixre of prior 
processing . 

Given such an intractable situation, and given that 
we have no means of direct experimentation in this area, 
it is necessary to proceed with a degree of formality. 



Initially, then, I 
about levels of function 
of speech. I shall not 
assumptions, but expect 
would add that evidence 
as specifying necessary 
hard to imagine. The as 



want to make certain assumptions 
withinany schema for the production 
produce evidence for the.se 
them to be non-controversial , and 
against them - interpreted simply 
levels of functioning - would be 
sumptions are: 



(i) That there is a phonological level of p 
- that is to say, processing which reflects the 
structure of the language, the rules associated 
are not affected by variable-s associated either 
motor system or with the environment, where the 
can be defined as including the output of, say, 
centres within the brain. 
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(ii) Thar there is a level of processing which we 
can call that of motor planning,. I thus separate the 
central core of phonetic processing from the consequences 
of such processing, also within the purview of the model, 
but represented as assumption 



(iii) That there is a level of processing which can 
be seen as executing the motor plan derived under (ii) . 

(iv) That there is a level of coiitrol over the 
execution of the motor plan. 



2 . 0.2 

Of the interrelationships between these assumptions, 
it is possible to note at this stage: first, that the 

dependency of (ii) on (i) reflects the overriding 
assumption stated in the introduction, that phonetic 
processing was non- autonomous And second, and, important 
from the point of view of limiting the discussion below. 



55 



iii"> and Civl, the execution of the motor plan and the 
^“tlorover that execution, can effectively be taken « 
Yialuded in the level of motor planning for the purposes 
rmofef-iuilding, though of course antecendent to it in 
. physical system. The explanation is as follows, 
evel of motor planning can be said to ^^olud ^ 

execution of the plan, J it TeQuire 

,f the motor planning device will undoubtedly requ 

nowledee of the likely results of plans at a lower level, 

ierce tfe higher device can be said Include at east 

I partial model of the lower device. ’ ^Ises 

w-i 1 1 hp <;een as of special relevance in tho.^e cases 

!here specific action must be taken 

counteract an effect which would otherwise inevitably 
;«ur "ra ^esult of the operation of ‘»e lower d v,ce 
Further, the level of control over 

to be included in the motor planning stage, .ince we 

suppose that the form of commands 8 ^^®", of^the 

level will be constrained by the prior knowledge of the 
control to be exercised over the execution of those 
ro!:“anL given that the degree of control as proportronal 
to the amount of feedback expected. 

In effect, therefore,, the value of the discussion is 
r,ni- limited bv being restricted to levels (i) and (i ). 



2 . 1 



I want to consider now what might be inferred about 
he nLure of the processing within the phonetic component, 
iven the above limited set of assumptions, together with 
ertain non-controversial aspects of the input (aspects 
hich are also to be seen as constant over most subsets 
f the output data). Of the two examples taken, the fi^st 
f oSvious! though'not trivial, while the secon. is more 
roblematical . 



2 . 1.1 

We notice first that the output, speech, in whatever 
£ rr, A -t- i K camnled is a continuous function of time. we 
farther ioti'^Jhai. although it is possible^that certarn 
phonological features (notably that of 

have as realisations an aspect of ^ 

is nonetheless to be expected that a phonology meant to 
capture the generalities of the sound structure cf a 



language, with a terminal representation neutral 
respect to conditions o£ utterance - which is what the 
assumption above implies - will never require .rule 
environments specified in terms of measurable time. 

(This argument is further strengthened by the observation 
that whereas phonological features of lenph may be shared 
in a commLity, real time realisates of these features 
will vary widely between individuals, and 

of a particular individual generally, and under different 
cLdi?Lns in particular. This last point anticipates a 
later stage in the argument.) 

Given that the neuromuscular system is interpretative, 
we conclude that the transformation of phonological 
generalities into a form that can be realised in real time 
is a function of the motor planning device. 



2 . 1.2 

The second characteristic of th.e output I wish to 
draw attention to is the unit of organisation which is 
manifest in the output. We are accustomed to label this 
unit the syllable. 

The question to be decided is whether to regard 
syllabification as a property of the motor planner itselt, 
or as a property of the input from the phonology. 

Now it can be shown that, whatever the genesis of 
the syllable, syllabic processing is 

motor control level. For if we as . w y s ou , 

necessary for the output to manifest syl ,abic 
traditional answer might be: ' a r j 

if we further ask why memory load should need to J 
at precisely this point, the answer is surely that there 
are^motor plans which need to be made with respect to a 
considerable stretch of the utterance - these include 
plans relating to intonational phenomena, speech rate 
so on. Thus, for the purpose of such operations, one 
envisage a situation for a segmental input ”^®^®, „ 

incoming segments are fed into a series of syllable forming 
rules and are then buffered such that the suprasegmental 
timing rules can operate. 

It can further be observed from the output that the 
ostensible syllable structure of the utterance appears to 
be changed with increasing speech rate. I have not yet 
derived a means whereby phbnetic processing can be made 
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ria such as speech rate (see 
we can hypothesise that 
ersion are processes that 
fact, in any general account 
variables upon speech 
be of interest to consider the 
ariable of speech rate as 
labification rules - and this 
d in Tatham [forthcoming]. 



These arguments hold under the assumption of a 
segmental input; but for obvious reasons I am not prepared 
to admit assumptions about the form of the input into this 
argument. Reverting to the original question posed in 
this section, then, on what grounds might we suppose the 
syllable to be a phonological entity, and how would such 
a supposition affect our view of the internal workings of 
a phonetic component? 



The phonological status of the syllable has been the 
subject of debate (Kohler [1966]; Andersen [1969]; Fudge 
[1969]); it has been noted that the syllable appears to 
be an implicit notion in segmental phonologies such as 
that of Chomskv and Halle [1968]; and in their Akan phonology 
Schachter and Fromkin [1968] make explicit use of a phono- 
logical syllable. 



Obviously even a partial consideration of tne status 
of the syllable within phonology would be far beyond the 
scope of this paper, but the point is crucial for the 
reason given above, namely that I do not wish to make 
assumptions about the nature of the input. A convenient 
solution tTo this dilemma would be to show that whichever 
of the two alternatives - syllabic or segmental input - 
were chosen the effect on the form of the motor planning 
device would be minimal. If this could be shown, then it 
could further be shown that the type of input problems at 
issue later in this paper remained problems whether the 
input is specified asbeing segmental or syllabic in form. 
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Ln some measure inappropriate for the motor processing. 

4ere one would define 'inappropriate' as 'lacking infor- 
nation that we can properly demand that the phpnetics 

supply ' . 

To take the most trivial set of examples: that of the 
labelling of features. It would seem that we must buila 
into phonetic processing a series of readjustment rules 
which map phonological features into parameters of ^^cul^r 
involvement. (Note that this will be the case even if the 
phonological features are expressed in direct articulatory 
terms j Now there is no reason to suppose that the mapping 
will be one-to-one, or that the input may not be redundant 
in that involvement of the same muscle may be signalled 
twice in the couse of a single segment. Further possibiliti 
exist: the presence of a particular muscle may be signalled 
by the simultaneous presence of two or more features, 
single feature may lack realisation; or may act only as a 
conditioning environment for the realisation of another 
feature by a particular muscle. The same possibilities 
exist, of course, for the specification of the absence of 
involvement of a particular muscle in s given articulation . 

In the motor planning processing envisaged for the 
case of a segmental input, then, we have a series of 
readjustment rules, followed by syllabification rules, 
after the operation of which the string is oufferea and 
operated upon by supr as egmental rules. What would be 
the effect on this configuration of a syllabically organised 
input? Given such a minimally specified set of processes, 
of course, it is not possible to predict the effect in 
great detail. But the following points can be made: 

fij I presume that even a syllabically organised input 
will still represent segments within syllables. Hence the 
readjustment rules will still necessarily apply. 

fiil It is unlikely that the motor specifications of 
the segments will be altered by their syllabic environments 
at the stage of the readjustment rules, for these 
environments will be phonological , in nature, language- 
specific in reference, and will have been used, I infer 
from the phonology of Schachter and Fromkin, to state 
sequential constraints on the combination of segments . 

The syllable at the mo^or level, on the other hand, the 
organisation of which may well affect the individual motor 
composition of the segments entering into combination, 
are motor entities and language universal in nature. 
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It may thus be argued that acceptance of a syllabic 
input has the single consequence that we must change the 
set of syllabification rules into a set of syllable- 
deformation rules. I would suggest that this represents 
a minor modification, and that it is thus possible to 
regard the syllable/segment dichotomy as of secondary 
importance in the general framework of input problems. 



2.3 

The model outlined in the previous section is not 
complete even in very general terms; nor have we reached 
the end of what we may legitimately infer about the nature 
of phonetic processing on the basis of the initial 
assumptions and observations on the output. Even 
minimal model previously described were supplied with 
detailed rule frameworks to carry cut the functions 
postulated, its output would fail to model our data on 

real speech. 

The principal features of speech output not represented 
in the output of the minimal model are all related to the 
variability of the output, and are as follows: 



2.3.1 
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We observe that speech is 
suggests that a number of the p 
are governed by factors extern 
situation - which factors I wi 
factors, where environmental, 
is taken as including * emotion 
the apparent change in syllabi 
(see also Kozhevnikov and Chis 
also point to the rise in over 
a function of external noise 1 
and to changes in speech rats 
urgency or excitement. Perhap 
reaction to environmental fact 
whispering, which is discussed 

It is to be noted that it is rarely the case that the 
variation brought about bx jpvironmental factors is either 
simple in form or limitedVas, ex.tent; change in speech rate, 
fox example, does not re^ajilt in simple* compression or 
expansion. Q- 



al • , 

e structure with speech rate 
tovich [1965]). One could 
all speech intensity as 
evel (Lane et aZ . [1970]) 

as a function, say, of 
s a paradigm example of 
ors is the phenomenon of 
in greater detail later. 



production is seen to be variable 
it terances of what is linguistically 
m, in cases where environmental 
I*#!*#* iMt*«l #•••<' ably be appealed to. This variation 
•«» 4# ll«it d extent than the first type. 
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«M 4««lf element of phonol ogi cal ly identical 
ft* •# ttt in two different languages, 

♦# 14* verall structure of that language. 

|i ■III 4« f that under the terms of allophonic 
I4««ff •4*lf4«4 • ve the consequences of this would be 
«4« ff»4t<ll»* l4«l e same degree of mechanical co- 
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I4«lf •■■IfvMMSit li both, languages , these mechanical 
• #f t l«»l(M !••• 4#li r known as intrinsic allophones. 
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l4* 4#»*I4IIH q£ a formal proof of the non-autonomy 

• # t#f • loph onic theory now predicts that the 

!• •%l*4 ••l<N al mechanical co-articulation is 

• lt«««4 I# • I iven segment in a given environment 

*• ^•4lf•4• fMil Qjj q£ structure of the language 

I* f4«t Ml j,ton and Tatham [1970] claimed that 

lA # ^ ly four vowels with which to 

/' i / , /ae/ , /a/ and /u/ , the 
tion for the [1 ] in the sequence 
r and would more nearly approach 
co-articulation than in a language 
icluding in particular /i/, /i/ and 
3 utterance in which the C is a 
3 of two in Language 1, and the two 
Tatham [197Cb] predicts greater, co- 
anguage 2 which possesses segments 
Language 1 but where the palatal 
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2.4 

Now of these three features of the speech output, 
the fLst, the dependence of speech parameters on aspects 

f the eJilronmen?. will be seen to ^„„etlc 

reorganisation of the internal organisation of the 
reorganisatiuii reorganisation has been 

processing mechanism. When tnis r s n„rnnt the 

effected then the two other features of the output, the 

onrweU attested, the other only hypothesised at 

s?feraro:s?‘Ton^?d^era?forof;th^sfi;eit-^«^ 

fa-;eni\^ri^pu? ^LJ^urro^rone n 

fact, I shall be discussing observations (ii) C J 

in detail only at this later stage. 



2 . 4.1 

In the original assumptions of this paper, the motor 

“fy%ft^rpSro?'o^ic:r?rp:“ “rmtant 

” srt:^fprefe“^?S: :ot^oi%!^a:nfnfarr/nL- 

^h-^/^hfL-n^r-fl^^ie-orJhe-^hforftic;^^^ 

of pLformance as the intersection of the linguistic 
intention and the environment* 

Suppose for the moment that the influence 
environment is conceptualised in quite ^ 
a single control element to be introduced at a point 
the processing already hypothesised for the ^ 

ninrlPl* N ow the point requires arguing at length, but, 

in t^ms o? ;ur control elLent conceptualisation, I_can 
see nrfor;al reason for introducing ^ the xnfluence^o^ 
environment at any particular point in Je 

processing, other than either at the beginning 
end. That is to say, at any given point there will al y 

be either preceeding or subsequent processes which will 
hnth reauire explicitly stated controlling conditions, 

Ei riere-i "if 



should be placed at the end of the hypothesised 
processing; and that is that we should be 

grounds of computational “the^ 

?ule which involves backward control, or 
words, involves the resynthesis in terms of an 
criterion of what has already been synthesised, presumably 
in the absence of such a control. 



However, a criterion of e 
could take many forms, and is 
sound criterion on which to ba 
position would be immeasurably 
possible to show that, once th 
processing being governed by e 
admitted, it is impossible to 
processing adequately without 
such a system modified by such 
demonstration would show the f 
by denying the possibility of 
external constraints. I shall 
with the aid of an example. 



fficiency of computation 
therefore not perhaps a 
se this argument. The 
strengthened if it were 
e principle of phonetic 
xternal criteria had been 
model a system of phonetic 
having all the products of 
constraints . Such a 
allacy of backward control 
any computation free from 
undertake the demonstration 



2.4.2 

The example is that of whispered speech. The example 
is apt, in that the occurrence of whisper can be seen 
directly related to environmental pressures o" 
sneakers. But at the same time one is hampered by a 
paucity of data on this subject. As will be well known 
Meyer-Eppler [1957] hypothesised a number of changes in 
the spectral composition of vowels to compensate for the 
absence of a means of signalling pitch changes in 
whispered speech. The conclusion that this 
convLtional procedure has recently been questioned by 
Fonagy [1969] . Kloster-Jensen [1958] examined _ 

perception of lexical pitch variations us ing examples 
from a number of languages. The laryngeal , adjustment _ 

for whisper is well known (see e.g. van den Ber^ 

On the phonological 1 ®^®T ^ ^ igper 

some discussion on possible means of handling whisper 

w i “th in 3. plioneinic . s y s *t cm . 

These studies, however, have, dealt with only^^ 

restricted aspects of the articulation of, whispered 
speLh, and even here no clear picture has emerged. I 
shall limit my discussion of whispering in the following 
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(i) I shall assume that the phonological represen- 
tation of an utterance is invariant, whether this 
utterance is realised in voiced or whispered mode. ■ 

Evidence for this claim is given by Sharf [1964] 
that those aspects of vowel duration which are ^ dep endent 
upon the phonological specification , 

following^consonants are maintained in whispered speech. 

fii] I shall restrict the problem to a consideration 
of segments marked phonologically as t+voice] (lacking , 
any information about the articulatory configur 
the larynx during . segments marked [-voice] in 
phonology) . 

Further, as well as observing that whispering _ 
annears to be a response to environmental pressure,: . 

sSIirassLa that. i? has been observed^that tha occurrence 
of voiced sequences is very much more frequent^than the 
occurrence of whispered sequences; and hence that 
phonetically we would wish to designate. whispered speech 
Is the marked occurrence, voiced speech as the unmarked 
occurrence . 



2.4.2.!^ ^ ^ , , , 

Now, under the readjustment rules within the^phonetic 
component that were introduced above ,. let assume _ that 

the phonological feature: [+voice] is transformed^into a 

f e atur e - specifying, ^ s ay , adduo t^on^ of- the vooal •. folds ^ . •• 

We may then represent the choice between voicing a 

whispLLg as Ltween. two. modes -of .adduction o^^^ vocal 

folds, say, vibratovy 'and uncoupled; - 'Now ,-ifnwe were_ 

Ltting out a- mode contrary > to;.hypothesis.,^where^voiced 

sequences -could be re al ise d* Without ;|being su ^ 

external constraint, -then it 

that ‘the state -symbo 1 ised ~ as <-unQOup Jed woul d need 
derived from the state-'VibrditGri/ -by-an^ optional- trans- -f- 
formation, with the environment specif ied only in -this . 

trlns^orm^tibn.' such k-state of affairs 

a confusion- of criteria: for although ■ it-is observed-that 

voiced sequences areC'of/more ^ 

of ■no independent argument 'whiehii.could-- ;sh 

possibili-t'ie's i are -to -hdye -'different rank orderings-within 

the -niodel ^ wi^^ result rf 




what are not equivalent are the probabilities of 
occurrence of the presence or absence of the environmental 
constraint relating to whispered speech. 



I would contend that the only system in which we can 
satisfactorily model both the rank equivalence and the 
distinction between marked and unmarked forms wifli respect 
to the two postulated realisations of the feature of 
adduction of the vocal folds, is to model the unmarked 
term as also being subject to an environmental constraint, 
the constraint that the condition which would lead to 



an 



utterance being whispered is not present 



In order to implement, this conclusion by means of 
rule arrays, we would need to envisage If-Then rules 
which referred to the existence or non-existence of 
environmental constraints. In order to provide a formal 
definition of the marked/unmarked distinction, we should 
require that a device first access those rules referring 
to the absence of a condition. 



2.4.3 



of 



To sum up, then, I hope to have shown by means 
this example the fallacy of backward control, and hence 
to h ave justified the rejection of the insertion of 
environmental constraints, viewed as a single control 
element, either at the end or in the middle of the 
linguistically-oriented processing of the 'minimal model ' 



that the control 



It has not. been proved, of course, 
element should be placed prior to this linguistically- 
oriented processing; and in fact this proves not to be 
necessary, for h aving us ed the, single control el emen t 
conceptualisation in order to show the fallacy of back- 
ward control, it is now possible to reconsider the notion, 
and in doing so effect a radical change in the shepe of 
the phonetic processing, rather than a simple addition 
as would be the ce.se if we adopted the single control 
element conceptuaklisation. For consider what processes 
would be necessari^ if there was indeed a single control 
element before the linginstic processing. We would peed 
to bear in mind that specific environmental constraints 
might well refer to highly specific parts of the subsequent 
linguistically-oriented processing. Hence we should need 
to postulate a mechanism such as that adopted by Chomsky 
and Halle [1968] for the treatment of exceptions: the 
attachment to segment^ of farther features specifying 
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fin our case) the environmental feature which will govern 
the form of processing at some point later in Jhe 
Such a mechanism would need to be combined with sets of 
alternative rules, labelled in terms of the features ^^}®y 
state the consequences of (or having that feature specifie 
il tL If part of an If-Then rule) stored at the processing 

locations later in the model. 

The system for rule specification is only what we 
have predicted before, but the mechanism of attaching 
features to segments in order to specify the rules y 
are subject to later in the model seems an entirely 
unmotivated artifice, which has the simple effect of 
increasing the computation load, in that more complex 
matrices are produced, some containing features which wiU 
perhaps need to be carried unnecessarily through a number 
orpr^cesses before being utilised to effect a selection 
from a choice of rules. It may further be noted that 
such a system would lead to unnecessary reduplication of 
feature specification in the case of environmental 
features relating to prosodic features of the utteranc . 

The solution to this dilemma lies, I believe, in 
abandoning the form of the minimal model and its 
orLntation to the linguistic form of the utterance and 
viewing the entire processing, as a whole and in its 
separate parts as a function of environmental features. 
We^should thus allow the environmental features to enter 
the model at the point in the processing where they are 
relevant, keeping the notion of rule arrays specified i 
terms of the presence or absence of such features. ^lewe 
as a single device, the phonetic “°tor planner will_thus 
have a number of inputs, of which the^phonological form 
of the utterance will be only one. T^e°“tput from the 
device will reflect the state of activity at all these 
points . 



2 . 4.4 : \ 

This then, is the final form of the model of phonetic 
processing in terms of which I wish to discuss the optimum 
form of input. The model as it stands makes cert^.in 
demands with respect to its input, but these can be more 
clearly dembnstrated in terms of the two apparently 
contradictory observations about the extent of variation 

in the output which were introduced earlier in this 
Qft rt- i on . ' ■ 
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3.0 



The two observations regarding variation in the 
speech output not already dealt with were: 

(i) That for two different languages with identically 
specified segments at a phonological level, in identical 
phonological environments, the degree of mechanical co- 
articulation for those segments would differ according to 
the total sound pattern of those languages. 

(ii) That for a single speaker on a given occasiona, 
repetitions of a phonologically identical utterance will 
not have the same physical realisations. 

(i) , it will be remembered, has the status of a 
hypothesis, while (ii) is well attested in a number of 
subsets of output data (see Mansell [1970] and the 
references there). It is interesting, therefore, that 
both types of variation can be handled by the same 
mechanism. 



3.1.0 

The hypothesis concerning differing degrees of co- 
articulation requires that information related to the 
structure of the language be' available at the level 
phonetic processing. The information contained in a single 
column of a distinctive feature matrix could be said to 
contain information not only about its own identity , but 
also, by inference from the labels of the rows of the 
matrix and the binary or integer notation in the cells 
of the matrix, information about all other segments. 

These 'other segments' would however be all other segments 
describable in terms of that particular matrix, irrespective 
of the pattern of that particular language. And even could 
we first derive inferential rules and second constrain 
such rules to produce the desired information about a 
particular language, it is doubtful whether such information 
would be of great practical importance in modelling this 
aspect of the output in terms of a model such as that 
described in the previous section. 

For in such an environment-doi^inated mpdql, the 
linguistic information that is to modify the ^motor pattern 
must be expressed in terms of the. aommunioabil'C'py of the 
sound pattern of a language. In short, we require fron 
the phonology information relating to the relative 
importance of segments anc^i”^|^ts of segments in the sound 
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tructure of the language, information which is to be 
ised in the ■phonetics to provide answers to such questions 
LS : given this communicative situation (i.e. the totality 

if environmental pressures) what are the features which 
rill enable me to distinguish this segment from others 
not necessarily all others) in the language. It is 
Interesting that Morton aud Tatham (op.eit.) and Tatham 
op.oit.') introduce the psychological principle of ^ 
iaximum differentiation to explain the overriding of 
latural co-articulatory tendencies that they hypothesise, 
(fhat a phonetic model as envisaged here must demand from 
Lts input is a formal analogue of just such » principle. 

This requirement in effect is a collapse of 
separate requirements: the first is that the labelling 

if the rows of the matrix, to put the problem in purely 
graphic terms, relate to the communication of the 
structure of the language. About this requirement I have 
lothing substantive to sayj the details of the re- 
adjustment rules postulated within the phonetic component 
are as yet so little worked out it is impossible to say 
whether an intermediate level between classif icatory and 
motor features (which is what in effect features relating 
to communication would be) is in fact necessary. The 
second requirement relates to the need for knowledge 
about the motor output relevant to a particular environ- 
mental load: in other words it requires hierarchical 

structuring within the columns of input matrices, the 
hierarchy to provide a rank ordering of features relative 
to their necessity of inclusion in the output in order 
that a particular distinction be maintained. 



3.1.1 

Turning now to the other instance of variation, that 
between successive utterances of phonological ly identical 
items, experimental evidence appears to show that this _ 
variation takes place on only a limited scale (though this 
may be a false impression induced by the fact that 
variation has not generally been investigated for its 
own sake, but has been reported on as a result of 
specialist investigations) . Thus, on the single parameter 
of lip protrusion for the vowel ['ii] Mansell and Allen 
[1970] reported preliminary evidence that successive 
articulations differed quite widely, although the 
phonological context was constant, and the subject was 
instructed to keep his utterances as similar as possible. 
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It is interesting to note that the subject was able, as 
a result of this instruction, to keep both duratlsji and 
amplitude constant; and it is further of note that 
spectrographic evidence did not show discernible spectral 
variations with the variations in lip protrusion. 

It would seem that, in order to explain data of this 
nature, we are again led to the concept of hierarchically 
ordered features within segments. For, just as it was 
argued that such ordering could provide an index of what 
was necessary for communication, so it can be argued that 
it can provide an index of what is noi necessary. In 
particular it was proposed (Mansell [1970]) that some 
features of segments could be viewed as being at such a 
low level in the communicative hierarchy that their value 
within a particular language pattern could be said to be 
almost negligible. Control over their production would 
therefore be at a minimum, and the value reached on some 
particular measurement parameter applied to the output 
would be in almost perpetual oscillation. 



3.2 

It is evident that the recj^uirements upon an input 
stated above are not met by the output from a Chomsky/ 
Halle phonology. It will be noticed that the inadequacy 
has not been demonstrated in terms of the basic form of 
the input - segmental or syllabic - nor even in terms of 
its matrix representation - viewing the task of phonetic 
processing as eliminating the rows and columns of a matrix 
can be a formal . analogue of the dual tasks of converting 
linguistic information to motor information and adding 
measurable time to the representation. Rather the 
objection has been to the terms of reference of such a 
putative input; and it has been suggested that the 
ref erence should principally be to the communicative 
function of the language structure, such a reference ^ 
being made principally by means of hierarchical ordering 
within segments. 

What is required in fact is a performance phonology 
organised in such terms; but lacking such a phonology it 
will be necessary to continue the process of inference 
from speech output data until we have a more viable and 
complete specification of the input data required. I 
hope in this section and the last to have shown that 
such an undertaking is possible, and, in that it leads to 
generalisations over a number^ of aspects of the output 
ah 1 e . • ^ 
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4.0 



The question could be ie 
as been made of a performanc 
hat it is possible to add to 
ispects of the competence/per 
(articular, I wish to see how 
ibove result in order to show 
:ompetence phonology (and sin 
IS a technical term this must 
jhonology) would satisfy the 
jhonetic component. 



ft at this point; but mention 
e phonology, and I believe 
the discussion by considering 
formance dichotomy. In 
far one can generalise the 
that no output from any 
ce we are using competence 
mean any variety of TG 
input requirements of, a 



4.1 



With regard to the first question, it does not 
ppear possible at this stage to ^ 

articular conclusion given in Section 3. A number of 
loints may be made, however, which, it is hoped, will 
;how those areas where we lack the knowledge that .alone 
rould make a decision on this q"®stion possible. It is 

;o be noted that the conclusions that Watt [1970] ha.^ 

•ecently reached in this area rest on t e asis o a 
lirect comparison between experimental evidence and 
iredictions of complexity derived from competence model, 
rhese conclusions were, briefly, that ^,®°“P®?®"®® . 

irammar, as defined by the theory, could^noc in any 
Its conceivable forms, be either included in 
rfith the mental grammar that a hearer employs to decide 
syntactic material. But that the sub-optimal grammar of 
the adult was nonetheless a grammar. 



Perhaps the best means of approaching^ this question 
s to consider a performative schema for tneproducuxuu 
f speech which will include processing in those areas 
e conventionally designate phonology and phonetics. It 
s possible then to ask. like Watt, whether the assumption 
hat a competence model (in this case the phonology) is 
o be included in a performance model (the schema) is 
alid. Chomsky's statement of this assumption is well 
;nown and reads as follows: 






. 



No doubt, a reasonable model of language use will 
incorporate as a basic component, the generative 
grammar that expresses the speaker-hearer's knowledge 
of the language; but the generative grammar does hot, 
in itself, prescribe the character or functioning of 
a perceptual model or a model of speech production. 

( Aspects , p.9) 

Now, it is evident that this basic assumption could 
be made a good deal weaker if it were possible to show 
that a competence model doss, in itself, constrain the 
character and functioning of a performance model. It 
seems to me that the rudiments of such a demonstration do 
exist, although lack of information does not enable us to 
carry the argument through. Here we need to make use of 
the descriptive term frequently applied to competence models 
- that they are monol'tth'io in form. This means in 
practice that the information stated in a competence model 
can only be accessed en masse (and this effectively means 
at the start of a performance model) or in blocks according 
to the natural breaks in the model. Now, it has already 
been seen that accepting the competence model as input to 
the performance gives rise to problems as to the specificatK 
of this input. For a convincing argument it would need to 
be shown that such input problems were ineradicable given 
the form of the competence model, and that the alternative, 
dispersion in blocks, was at the least unmotivated. 

Hence a proof of the generic unsuitability of 
competence models within performance would have to show: 

(i) That the natural breaks in such models are of no 
behavioural significance, 

(ii) That the form of the output at present employed 
is necessary to the operation of the present and any future 
competence phonology, 

(iii) That tne introduction of such information as a 
phonetic component might demand into a competence phonology 
would not be justifiable in terms of the general constraints 
that such competence models are subject to. (This last 
being closest to the types of arguments used by Watt [op 
ait . ]■.') 




It is obvious, I think, that we are in ho position to 
demonstrate either (i) or (li). 

. OA- 
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The only theorist to my knowledge to have considered 
the behavioural relevance of the breaks in the phonological 
model is Halle [1962] (though Kiparsky [1968a, 1968b] has 
made claims for the behavioural relevance of other aspects 
of phonological formalism) , who uses the divisions of a 
competence phonology in the process of discussing an 
'intelligibility criterion' for the introduction of 
innovations into the sound patterns of languages. He 
writes : 

I am unable at present to characterize the place in 
the order where rules may be added with a minimum 
impairment of intelligibility* Such additions, 
however, seem characteristically to occur at points 
w he re t he re a re natural breaks in the grammar. 

I know of no claims to the contrary, or of further claims 
denying the behavioural relevance of breaks in the 
grammar. 

Farther, (ii) seems implausible. Wilson [1966] has 
specific arguments that aspects of generative phonology 
are not functions of the adoption of distinctive feature 
matrices, as Halle [1962] appears at one point to claim, 
but have a quite separate basis for validity in general 
linguistic theory.- And in principle, I can see no reason 
why a phonology should be constrained to a particular 
phonetic representation, except in terms of (iii) below. 

I -t is interesting to note in this context that Chomsky 
[1964, p.l20] proposed the adoption of a universal phonetic 
alphabet for the very utilitarian purpose of satisfying 
the requirement that a grammar should have a set of out- 
put sentences. 

(iii) would appear, then, to be the most profitable 
line of investigation here; and it indeed seems possible 
that a cemoetence phonology could not be found that could 
justify by* internal criteria the representation of such 
information as the phonetic processing appears to require. 

I am not at present prepared, however, to use the 
relatively meagre set of input criteria derived for 
phonetics for this purpose. 
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tl 4* «#•••• ory to conclude this discussion on 

• » 4 l 4 • however. It is the case that we 

»««4 «• t *l4«<»'..ic behaviour on a level beyond that 

mt 4444»l4«4t /ioural acts; what is becoming clear, 

<*» -n psycholinguistics (see Watt, op o^t . ; 
t»4«r C*' ^tt [1966]; Vandamme [1968]) is that 

» ^ " ,;odels are particular constructs within 

theory which are not necessarily related to 
task of modelling performance. Watt [op oit.] has 
derived an Abstract Performative Grammar on the basis of 
his experimental evidence. It is to be hoped that 
further attempts to specify the nature of the input to 
the phonetic component, and to systematise the form of 
the rules within that component, may eventually lead to 
a similar solution in phonetic modelling. 



5.0 

Very briefly, those areas of the above discussion 
which require experimental evidence with some urgency are 
those of alternation , variation and oo-articulatzon . 



5.1 



The treatment of alternation in the. model has been 
investigated by means of the example of whispered speech. 

It is imperative in order to add substance to this 
treatment, that an investigation of articulatory activity 
during whispered speech b^;^arried out. It is suggested 
that only with a knowledge- of articulatory parameters can 
we hope to approach the question of compensatory variations 
in the acoustic output with any degree of success. 




The study of the variations between successive 
utterances of phonological ly identical tokens is already 
J^p^ogreL! U was slimulaied initially by an investigation 
of various unsatisfactory features of electromyographic 
technique in speech research (see Mansell |^970] and 
Mansell and Allen [1970] for details). This 
continuing and has received an additional impetus from 
the theoretical interest of the topic. 



5.3 



A large part of the argument above rests on a 
hypothesis concerning extent of co-articulation in 
different languages. Experimental validation of this 
hypothesis must wait on the setting up of adequate 
parameters to measure co- articul ation . The ° . 

Ohman on the tongue has perhaps constrained us 

into thinking of co-articulation as being a predominantly 

two- dimentional phenomenon . 



6.0 



Conclusion 

This paper has dealt primarily with problems 
concerning the nature of the input to a phonetic processor, 
various criteria for such an input relating to the 
oommunioation of the sound structure of a were 

derived as a result of a specification or that phonetic 
processor on the basis of a limited number of assumptions 
linked to inferences from speech data. 

It was further suggested that such procedures were of 
theoretical as well as practical relevance in that they 
make it possible to raise larger questions concerning the 
interrelation of competence and performance. 



Notes 



iClear, that is, if either one of two simplifying 
assumptions is made : 

ril That the set of 'extrinsic allophonic rules' is 
coextensive with the set of phonological rules, given in 
some phonology. 

(ii) That the notion 'the set of extrinsic allophones 
is defined simply as coextensive with the given set of 
outputs from a presupposed but not necessarily characterised 

phonology . 

Assumption (ii) may be held, of course whatever 
content we might wish to give to the notion extrinsic 
allophone' (and it will be seen from the text that the 
view preferred here does not correspond to the output ot 
any known phonology); though this assumption may be a 
convenient one for this reason, it is evident that any 
characterisation of the notion 'extrinsic allophone would 
sain in interest if constrained from two directions , by 
the nature of both phonological and phonetic processing. 

The adoption of Assumption (i) may be justified by 
the emergence of seme future phonology; for the present 
the testing of this assumption against an existent 
phonology is interesting, not necessarily for the nature 
of the result, but for the purpose of deriving the grounds 
on which we might wish to partition a set of phonological 
rules. For one such attempt and attendant reasoning, 
see Postal [1968] (Universal Detail Rules, p.66 f.). 

These questions are not explicitly discussed in the 
body of the text, but would be of great importance in a 
wider treatment of the phonology/phonetics interface. 



2i am here regarding phonological matrices as capable of 
being transformed only into matrices where obligatory 
presence, or absence, or neutrality is shown for particular 
muscles or muscle groups. Given that all information 
relevant to motor planning is extracted from the input 
string. I would wish the specification of amount ot 
muscle contraction in a given instance to be a subsequent, 
purely motor, decision.. In this view, the phonological 
specification by means of integers of degrees of features 
(with the exception, of course, of stress) would only 
necessary in order to change which muscles were selected. 
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This argument does not hold for the Lugisu phonology 
;iven by Brown [1969]. She writes (p.6]: 

The syllable we are concerned with here is not ... 
an unLrlyihg d i st r i but i ona 1 un 1 1 , ^ P ' 

syllable, but the distributional unit of the 
phonetic real ization. 

is beyond the scope of this paper, however, to discuss 
“ “ “ ‘ p r opos al s 



he details of Brown's 



rhe Question to be decided experimentally is whether we 
an represent the whispered mode of speech as differing 

eed tfconsider it to be an entirely separate programme 
n the phonetic level. 

This issue is basically that which is examined with 
espect to the communication of pitch ^ 

-hisper in Meyer-Eppler [1957] and a 

extl The same problem, however, presents itseit in 

ZlVsiJvler for/when we co.e 

'I :rifed aid'^Mspe^ed «res® If we were ti hypothesise 
local defamation of the total 
utterance in the case of whispered speech, then we 

irr: EiidS- 

whispered speech would be avoided. Hence we snouiu 
expect to find no abduction manoeuvre of the glottis 
during voiceless stops. 

of the phonetic component. That there should be a 
separate strategy, involving the excision of redundancy 
and the employment of .^xtra whisper-specific manoeuvres. 
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would fulfil a criterion of simplicity just for the 
limited case of whispered speech. 

It will be seen then that experimentation concerned 
with glottal manoeuvres during speech is of quite 
general relevance. Such experimentation is at present 
under preparation in this laboratory. 



^The formulation requires further explication for the 
following reasons: 

(i) The notion 'rank ordering' is imprecise. 

(ii) The rejected device of deriving the marked 
form from the unmarked by means of an optional trans- 
formation is an approach conventionally used in the 
syntactic component of a generative grammar. 



In the case where Z is the marked realisation of Y, 
and A the unmarked realisation, suppose the alternation 
rules: 



1. If X, then Y-^-Z 

2 . If X, then Y-^A' 

Suppose further that these rules have derivational 
consequences : 

3. If Z, then (Z^, Z2, Z3, ...» Zn) 

4. If A, then (A^, A2 » A3, ...» Am) 



where Z\, Z2 etc. and Ai , A2 etc. r.re steps in the 
derivation of Z and A respectively, and Zn and Am belong 
to the terminal alphabet of the derivation, and m<n. 



Then the differences in complexity between rules 
{1,3} and rules{2,4} lie either in the 1/ parts of 
or in the Ihen parts of { 3 , 4 } or both. I have shown the 
case where the marked realisation is derivational ly more 
complex: this would be conventionally held to be a 

justification for the choice of Z as the marked term. 

But if we were to graph the comparative derivational 
history of the two forms, say by entering on a tree 
diagram the results of successive operations, it would 
be obviously seen that the non-terminal categories 
and Z would have precisely the s me degree of derivational 
complexity intervening between them and the super- 
ordinate category V - in this case they would both be 
immediately dominated by 'I . It is claimed that this ^ 
equivalence of rank ordering of the alternate realisatio 
of a superordinate category is in principle distinct from 
the non-equivalence of the derivational complexity 
intervening between A and Z and Am and Zn* 
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(A brief note on the rules above: For the special 
case of C2) where A=Y, and hence where for (4) Am=0, 
however, there remains a formal problem. Either (2j is 
to be regarded as redundant, or is to be allowed to apply 
vacuously. In the first interpretation, the marked form 
is derived from the unmarked form by default. For 
opposing views on this point, see Lamb [1964] and Postal 

[1968].] 



Turning now to the case of syntax, it is necessary 
to examine the status of optional- transformations . We 
can distinguish between a rule system for the enmeration 
of sentences (a competence grammar], and a pass through 
these rules (a schematic, though scarcely adequate, account 
cf performance]. In the latter case, the existence of an 
optional transformation constitutes a hypothesis about 
language behaviour - namely that the variation is random 
in nature. If random behaviour does not appear 
appropriate, and conditions are attached to the alternation, 
then the optional nature of the process is destroyed. 

In the case of the enumerative system, I would claim 
that the existence of optional transformations shows the 
limit of applicability of the categories defined in the 
system. Evidently, there is a form of trading relationship 
between the number of categories defined in the system 
Cand hence the number available for the statement of rule 
applicability contexts] and the number of situations which 
can be defined within that system. (The total 
situations may be less than the number which could be 
defined given the number of categories, according to 
the operation of the evaluation measures chosen, of course; 
indeed, we may expect to find optional transformations in 
a system where the number of categories is optimal with 
respect to a particular evaluation measure for the system 
as a whole. What is in question here is not the 
satisfactory or unsatisfactory nature of the devices ot 
particular competence systems, but whether these devices 
have relevance for the formalisms of a phonetic component.] 
To take only one example: it appears to be the case that 
were syntactic theory to be extended to include 
categories relating to discourse phenomena (Topic, Comment, 
Focus, Presupposition, etc.] then the optional status of 
certain transformations (notably the Pseudo-cleft; see 
Kay [1967] p.9] would be no ^^er , def ensib 1 e in certain 
situations defined by these categories. 



In phonetics one is hoping to specify a performance 
model of much greater complexity than the 'pass through 
the enumerative rules' referred to above; even in the 
present initial stages, it seems unlikely that random 



variation will play much, part in phonetic explanations. 
Further, the categories of phonetics are at present far 
from being defined or even listed. The relevance, there- 
fore, of a schema for handling alternation which appears 
to be a function of a particular limited set of categories 
is disputed. 
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